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7018610 INSPEC Abstract Number: C2001-10-7250-008 
Title: CLARIT TREC-8 manual ad hoc experiments 

Author(s): Evans, D.A. ; Bennett, J.; Xiang Tong; Huettner, A.; Chengxiang 
Zhai; Stoica, E. 

Conference Title: Information Technology: Eighth Text REtrieval 
Conference (TREC-8) (NIST SP 500-246) p. 335-40 
Editor(s): Voorhees, E.M.; Harman, D.K. 
Publisher: NIST, Gaithersburg, MD, USA 

Publication Date: 2000 Country of Publication: USA xxix+1147 pp. 
Material Identity Number: XX-2001-01522 

Conference Title: Information Technology: Eighth Text REtrieval 

Conference (TREC-8) 

Conference Sponsor: NIST; Defense Adv. Res. Projects Agency 

Conference Date: 16-19 Nov. 1999 Conference Location: Gaithersburg, 

MD, USA 

Language: English Document Type: Conference Paper (PA) 
Treatment: Applications (A); Practical (P) 

Abstract: CLARITECH 1 s submission in TREC-7 demonstrated the utility of 
document clustering in retrieval. We continued this work in TREC-8, using a 
clustered document presentation exclusively. We also added significant new 
functionality to the manual ad hoc user interface, integrating it with an 
entity extraction subsystem (upgraded and customized for TREC) . Extracted 
entities represent an alternate set of document features. Our experiments 
suggest that in many cases users might construct more effective queries by 
moving beyond surface terms and drawing from this more abstract pool of 
semantic types. Despite the interface enhancements, our focus this year was 
on system rather than human subject performance, and we simplified the 
experiment design accordingly. From the users 1 perspective, there was only 
one run; the five separate submissions represent variations in 
postprocessing. We spent minimal time preparing the initial queries. Users 
had 20 (instead of last year's 30) minutes for relevance judgments, and 
were allowed to modify the query from the start. This year, as well, we 
reintroduced "vector-length optimization" in the post-processing of 
feedback. Recent CLARITECH systems have augmented the manually generated 
queries with a fixed, arbitrary number of selected terms from top- ranked 
documents . This year, we experimented with a principled truncation of 
the candidate term list, and found this had a positive effect on the 
performance of both of our TREC-7 and TREC-8 final queries. We feel that 
further performance improvements are likely to be achieved only by 
developing several complementary techniques and applying them selectively 
to fine-tune individual queries. User -directed feature selection and 
vector-length optimization are two such promising techniques. (0 Refs) 
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6591460 INSPEC Abstract Number: C2000-06-7250R-030 
Title: Enhancing concept-based retrieval based on minimal term sets 

Author (s): Alsaffar, A.H.; Deogun, J.S.; Raghavan, V.V.; Sever, H. 

Author Affiliation: Dept. of Comput . Sci . & Eng., Nebraska Univ., 
Lincoln, NE, USA 

Journal: Journal of Intelligent Information Systems: Integrating 
Artificial Intelligence and Database Technologies Conference Title: J. 
Intell. Inf. Syst . , Integr. Artif. Intell. Database Technol . (Netherlands) 
vol.14, no. 2-3 p. 155-73 

Publisher: Kluwer Academic Publishers, 

Publication Date: March- June 2000 Country of Publication: Netherlands 
CODEN: JIISEH ISSN: 0925-9902 

SICI: 0925-9902 (200003/06) 14 : 2/3L . 155 : ECBR; 1-G 
Material Identity Number: C318-2000-002 

U.S. Copyright Clearance Center Code: 0925-9902/2000/$18 . 00 

Conference Title: Proceedings of ISMIS'99: 11th International Symposium 

on Methodologies for Intelligent Systems 

Conference Sponsor: ICS PAS; Polish- Japanese Sch. Inf. Technol 
Conference Date: 8-11 June 1999 Conference Location: Warsaw, Poland 
Language: English Document Type: Conference Paper (PA); Journal Paper 

(JP) 

Treatment: Practical (P) 

Abstract: There is considerable interest in bridging the terminological 
gap that exists between the way users prefer to specify their information 
needs and the way queries are expressed in terms of keywords or text 
expressions that occur in documents. One of the approaches proposed for 
bridging this gap is based on technologies for expert systems. The central 
idea of such an approach was introduced in the context of a system called 
Rule Based Information Retrieval by Computer (RUBRIC) . In RUBRIC, user 
query topics (or concepts) are captured in a rule base represented by an 
AND/OR tree. The evaluation of AND/OR tree is essentially based on minimum 
and maximum weights of query terms for conjunctions and disjunctions, 
respectively. The time to generate the retrieval output of AND/OR tree for 
a given query topic is exponential in number of conjunctions in the DNF 
expression associated with the query topic. We propose a new approach for 
computing the retrieval output. The proposed approach involves 
preprocessing of the rule base to generate minimal term sets (MTSs) that 
speed up the retrieval process. The computational complexity of the on-line 
query evaluation following the preprocessing is polynomial in m. We show 
that the computation and use of MTSs allows a user to choose query 
topics that best suit their needs and to use retrieval functions that yield 
a more refined and controlled retrieval output than is possible with the 
AND/OR tree when document terms are binary. We incorporate p-Norm model 
into the process of evaluating MTSs to handle the case where weights of 
both documents and query terms are non-binary. (11 Refs) 
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04004732 INSPEC Abstract Number: C91075373 

Title: The Observer: a software system for collection and analysis of 
observational data 

Author ( s ) : Noldus , L . P . J . J . 

Author Affiliation: Noldus Inf. Technol. bv, Wageningen, Netherlands 



Journal: Behavior Research Methods, Instruments, & Computers vol.23, 
no. 3 p. 415-29 

Publication Date: Aug. 1991 Country of Publication: USA 
CODEN: BRMCEW ISSN: 0743-3808 

U.S. Copyright Clearance Center Code: 0743-3808/91/$ . 50+ . 10 
Language: English Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: The Observer is a software package for event recording and data 
analysis in behavioral research. It allows any IBM compatible to serve as 
an event recorder and can generate dedicated event-recording programs for 
several types of non-IBM-compatible portable computers and transfer files. 
The configuration can be either used directly for event recording on the PC 
or passed on to a program generator that creates a program to collect data 
on a hand-held computer. Observational data from either type of computer 
can be analyzed by the program. Event-recording configurations can be 
tailored to many different experimental designs. An online electronic 
notepad permits note taking and the program also includes online error 
correction. User comments as well as independent variables can be stored. 
During data analysis, the user can select the level of analysis and the 
type of output file. The Observer calculates frequency of occurrence and 
duration for classes of events, individual events, or combinations of 
events. For analysis of concurrence, one can select the number of nesting 
levels and the order of nesting. Output can be generated in the form of 
sorted event sequence files , text report files, and tabular ASCII 
files. The results can be exported to spreadsheet and statistical programs. 
(16 Refs) 
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DIALOG (R) File 202: Info. Sci . & Tech. Abs . 
(c) 2003 EBSCO Publishing. All rts. reserv. 

0300188 

Combinations of analysis methods merged output results. 

Book Title: In Harvard University. Computation Laboratory. Information 

Storage And Retrieval. Scientific Report No. Isr-9 To The National Science 

Foundation. 1965 August. Harvard University, Cambridge, Mass., P. Xix-1 To 

Xix-10. 6 Illus. See Da 68-098. 

Author(s): Rocchio, Joseph 

Publication Date: 1965 

Language: English 

Document Type: Book Chapter 

Record Type: Abstract 

Journal Announcement: 0300 

Automatic indexing of documents in the smart system is considered. To 
meet the user's needs, an operational retrieval system may require a 
variety of information analysis techniques. To evaluate the effectiveness 
of individual retrieval methods, their results are compared with the 
combined result obtained after merging different procedures. The merging 
technique used in the smart system is outlined. The order of indexing 
following merging in shown to be dependent upon the sequence in which the 
component methods are considered. Indexing is improved by merging if the 
individual methods are successful in identifying different subsets of the 
relevant set for the input query. In the case where a given method 
dominates others. I.e., retrieves every relevant document with the 
highest rank index, merging fails to improve the quality of indexing. 
Experimental data on document retrieval obtained by queries in the smart 
system for various combinations of methods are presented. Several different 
analysis procedures rather than a single method of indexing should be 
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6471400 INSPEC Abstract Number: C2000-02-7210N-127 
Title: Web document presending based on user behavior patterns 
Author (s): Zhu Pei-Dong; Lu Xi-Cheng; Zhou Xing-Ming 

Author Affiliation: Sch. of Comput . , Nat. Univ. of Defense Technol . , 
Changsha, China 

Journal: Journal of Software vol.10, no. 11 p. 1142-7 
Publisher: Science Press, 

Publication Date: Nov. 1999 Country of Publication: China 

CODEN: RUXUEW ISSN: 1000-9825 

SICI: 1000-9825(199911) 10: 11L. 1142 : DPBU; 1-T 

Material Identity Nuinber: G255-2000-002 

Language: Chinese Document Type: Journal Paper (JP) 

Treatment: Practical (P) 

Abstract: Presending is an active service which extends the caching 
mechanism from temporal locality to spatial locality. Two modes of 
extracting user behavior patterns are proposed to predict future 
requests from clients for efficient presending. The URL-based mode exploits 
the Markov-chain features of the request series, and can be used for 
hierarchical presending. The session-based mode captures more semantics, 
and the authors 1 work emphasizes the clustering algorithm, feasible 
document weight definition, and attribute- vector -distance computation 
representing order of accesses. Their performance is evaluated using 



appropriate metrics such as request hit rate, session hit rate, presending 
efficiency and presending cost. Numerous experiments are carried out to 
compare the two modes. These methods are used for Web presending, while 
they are helpful to Web server design and ISP (Internet service provider) 
service planning. (11 Refs) 
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00107392 DOCUMENT TYPE: Review 

PRODUCT NAMES: Intranet DrawingSearcher 1.0 Windows 95 & NT (693383) 

TITLE : DrawingSearcher 

AUTHOR: Villegas, Dean 

SOURCE: CADalyst, vl5 n3 p43(4) Mar 1998 
ISSN: 0820-5450 

HOMEPAGE : ht tp : / /www . cadonline . com 

RECORD TYPE: Review 
REVIEW TYPE: Review 
GRADE : A 

DocuPoint's Intranet DrawingSearcher 1.0 is a type of search engine 
dedicated to finding drawings on a network, including AutoCAD drawings. 
Intranet DrawingSearcher 1.0 uses the AltaVista search engine to find 
AutoCAD drawings and documents in over 100 popular PC file formats, 
including WordPerfect's, Word's, Excel ! s, and Adobe Acrobat's. Users search 
using any text, block name, or attribute value in the drawing. A list of 
files is returned with optional title block information. The indexer 
processes drawings from AutoCAD 2.6 to 14 releases, but the viewer operates 
only with AutoCAD 12 through 14. If the drawing title blocks use 
attributes, users can view the same title block data for each drawing 
in what appears to be a database record table in the search results page. 
Intranet DrawingSearcher has an advanced Search button that allows users to 
merge words or phrases using parentheses; the user then enters a start or 
end date, and ranking words to allow documents that contain the words 
selected to be shown first. Many document management programs require users 
to type file descriptions into a database or electronic card catalog before 
they can retrieve files. Intranet DrawingSearcher 1.0 does not require such 
document registration; it automatically indexes the network or intranet 
file's word content into a database. 
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